Blind Audiovisual Source Separation Using Sparse Redundant Representations

نویسندگان

  • Anna Llagostera Casanovas
  • Gianluca Monaci
  • Pierre Vandergheynst
  • Anna Llagostera
چکیده

In this work, we present a method that jointly separates active audio and visual structures on a given mixture. This new concept, the Blind Audiovisual Source Separation (BAVSS), is achieved by exploiting the coherence existing between the recorded signal of a video camera and only one microphone. An efficient representation of audio and video sequences allows to build robust audiovisual relationships between temporally correlated structures of both modalities or, what turns to be the same, two parts of the same audiovisual event. First, video sources are localized and separated on the image sequence exploiting the temporal occurrence of audiovisual events and using a spatial clustering algorithm, without necessity of any previous assumption about the number of sources in the mixture. Second, the same audiovisual relationships together with a time-frequency probabilistic analysis allow the separation of the audio sources in the soundtrack, and, consequently, the complete Audiovisual Separation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind Source Separation: the Sparsity Revolution

Over the last few years, the development of multi-channel sensors motivated interest in methods for the coherent processing of multivariate data. Some specific issues have already been addressed as testified by the wide literature on the so-called blind source separation (BSS) problem. In this context, as clearly emphasized by previous work, it is fundamental that the sources to be retrieved pr...

متن کامل

Blind Separation of Sources in functional MRI sequences

Functional Magnetic Resonance Imaging (fMRI) is an important and popular tool for studying the human brain activity. In most fMRI scans, the BOLD technique is used, producing an image of the blood oxygenation level throughout the brain. High oxygenation levels represent high activity of brain regions responsible for performance of a specific task. The process can be modeled as a linear mixture ...

متن کامل

Optimal Sparse Representations for Blind Deconvolution of Images

The relative Newton algorithm, previously proposed for quasi maximum likelihood blind source separation and blind deconvolution of one-dimensional signals is generalized for blind deconvolution of images. Smooth approximation of the absolute value is used in modelling the log probability density function, which is suitable for sparse sources. We propose a method of sparsification, which allows ...

متن کامل

Using Joint Sparsity for Blind Separation of Noisy Multichannel Signals

We call a set of vectors z[k] ∈ R jointly sparse, when for the most of them all m components are simultaneously [close to] zero. When recovering this set from [indirect] noisy observations using variational approach, joint sparsity prior can be expressed via convex penalty term ∑ k ‖z[k]‖2. In this work we explore joint sparsity in the context of blind source separation problem X = AS + ξ, wher...

متن کامل

Underdetermined Anechoic Blind Source Separation

In this paper, we address the problem of under-determined Blind Source Separation (BSS) of anechoic speech mixtures. We propose a demixing algorithm that exploits the sparsity of certain time-frequency expansions of speech signals. Our algorithm merges `-basis-pursuit with ideas based on the degenerate unmixing estimation technique (DUET) [1]. There are two main novel components to our approach...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006